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ADAPTIVE MODULATION FOR MULTI-ANTENNA TRANSMISSIONS WITH 

PARTIAL CHANNEL KNOWLEDGE 

[0001] This application claims priority from U.S. Provisional Application Serial No. 
60/499,754, filed September 3, 2003, the entire content of which is incorporated herein by 
reference. 

TECHNICAL FIELD 
[0002] The invention relates to wireless communication and, more particularly, to coding 
techniques for multi-antenna transmitters. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR 

DEVELOPMENT 

[0003] This invention was made with Government support under Contract Nos. CCR- 
0105612, awarded by the National Science Foundation, and Contract No. DAAD 19-0 1-2- 
001 1 (Telcordia Technologies, Inc.) awarded by the U.S. Army. The Government may have 
certain rights in this invention. 

BACKGROUND 

[0004] By matching transmitter parameters to time varying channel conditions, adaptive 
modulation can increase the transmission rate considerably, which justifies its popularity for 
future high-rate wireless applications. The adaptive modulation makes use of channel state 
information (CSI) at the transmitter, which may be obtained through a feedback channel. 
Adaptive designs assuming perfect CSI work well only when CSI imperfections induced by 
channel estimation errors and/or feedback delays are limited. For example, an adaptive 
system with delayed error-free feedback should maintain a feedback delay x <0.01//d, where 
/d denotes the Doppler frequency. Such stringent constraint is hard to ensure in practice, 
unless channel fading is sufficiently slow. However, long range channel predictors relax this 
delay constraint considerably. An alternative approach is to account for CSI imperfections 
explicitly, when designing the adaptive modulator. 
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[0005] On the other hand, antenna diversity has been established as an effective fading 
counter measure for wireless applications. Due to size and cost limitations, mobile units can 
typically only afford one or two antennas, which motivates multiple transmit-antennas at the 
base station. With either perfect or partial CSI at the transmitter, the capacity and 
performance of multi-antenna transmissions can be further improved. 
[0006] Adaptive modulation has the potential to increase the system throughput significantly 
by matching transmitter parameters to time-varying channel conditions. However, adaptive 
modulation schemes that rely on perfect channel state information (CSI) are sensitive to CSI 
imperfections induced by estimation errors and feedback delays. 

[0007] Moreover as symbol rates increase in broadband wireless applications, the underlying 
Multi-Input Multi-Output (MIMO) channels exhibit strong frequency-selectivity. By 
transforming frequency-selective channels to an equivalent set of frequency- flat sub- 
channels, orthogonal frequency division multiplexing (OFDM) has emerged as an attractive 
transmission modality, because it comes with low-complexity (de)modulation, equalization, 
and decoding, to mitigate frequency-selective fading effects. One challenge for an adaptive 
MIMO-OFDM transmissions involves determining whether and what type of CSI can be 
made practically available to the transmitter in a wireless setting where fading channels are 
randomly varying. 

SUMMARY 

[0008] In general, the invention is directed to adaptive modulation schemes for multi-antenna 
transmissions with partial channel knowledge. The techniques are first described in 
reference to single-carrier, flat- fading channels. The techniques are then extended to multi- 
carrier, frequency-fading channels. 

[0009] In particular, a transmitter is described that includes a two-dimensional beamformer 
where Alamouti coded data streams are power loaded and transmitted along two orthogonal 
basis beams. The transmitter adjusts the basis beams, the power allocation between two 
beams, and the signal constellation, to improve, e.g., maximize, the system throughput while 
maintaining a prescribed bit error rate (BER). Adaptive trellis coded modulation may also be 
used to further increase the transmission rate. 
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[0010] The described adaptive multi-antenna modulation schemes are less sensitive to 
channel imperfections compared to single-antenna counterparts. In order to achieve the same 
transmission rate, an interesting tradeoff emerges between feedback quality and hardware 
complexity. As an example, the rate achieved by on transmit antenna when /a x <0.01 can be 
provided by two transmit antennas, but with a relaxed feedback delay /a x = 0.1, representing 
an order of magnitude improvement. 

[0011] Next, a partial CSI model for orthogonal frequency division multiplexed (OFDM) 
transmissions over multi-input multi-output (MIMO) frequency selective fading channels is 
described. In particular, this disclosure describes an adaptive MIMO-OFDM transmitter in 
which the adaptive two-dimensional coder-beamformer is applied on each OFDM subcarrier, 
along with an adaptive power and bit loading scheme across OFDM subcarriers. By making 
use of the available partial CSI at the transmitter, the transmission rate may be increased or 
maximized while guaranteeing a prescribed error performance under the constraint of fixed 
transmit-power. Numerical results confirm that the adaptive two-dimensional space-time 
coder-beamformer (with two basis beams as the two "strongest" eigenvectors of the 
channel's correlation matrix perceived at the transmitter) combined with adaptive OFDM 
(power and bit loaded with M-ary QAM constellations) improves the transmission rate 
considerably. 

[0012] In one embodiment, the invention is directed to a wireless communication device 
comprising a constellation selector, a beamformer, and a plurality of transmit antennas. The 
constellation selector adaptively selects a signal constellation from a set of constellations 
based on channel state information for a wireless communication channel, wherein the 
constellation selector maps information bits of an outbound data stream to symbols drawn 
from the selected constellation to produce a stream of symbols. The beamformer generates a 
plurality of coded data streams from the stream of symbols. The plurality of transmit 
antennas output waveforms in accordance with the plurality of coded data streams. 
[0013] In another embodiment, the invention is directed to a wireless communication device 
comprising a plurality of adaptive modulators that each comprises: (i) a constellation selector 
that adaptively selects a signal constellation from a set of constellations based on channel 
state information for a wireless communication channel, wherein the constellation selector 
maps the respective information bits to symbols drawn from the selected constellation to 
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produce a stream of symbols, and (ii) a beamformer that generates a plurality of coded data 
streams from the stream of symbols. The wireless communication device further comprises a 
modulator to produce a multi-carrier output waveform in accordance with the plurality of 
coded data streams for transmission through the wireless communication channel. 
[0014] In another embodiment, the invention is directed to a method comprising receiving 
channel state information for a wireless communication system, adaptively selecting a signal 
constellation from a set of constellations based on the channel state information, and coding 
signals for transmission by a multiple antenna transmitter based on the estimated channel 
information and the selected constellation. 

[0015] In another embodiment, the invention is directed to a computer-readable medium 
comprising instructions. The instructions cause a programmable processor to receive channel 
state information for a wireless communication system, and select a signal constellation from 
a set of constellations based on the channel state information. The instructions further cause 
the processor to map information bits of an outbound data stream to symbols drawn from the 
selected constellation to produce a stream of symbols, and apply an eigen-beamformer to 
generate a plurality of coded data streams from the stream of symbols to produce a plurality 
of coded signals. 

[0016] The details of one or more embodiments of the invention are set forth in the 
accompanying drawings and the description below. Other features, objects, and advantages 
of the invention will be apparent from the description and drawings, and from the claims. 

BRIEF DESCRIPTION OF DRAWINGS 

[0017] FIG. 1 is a graph that compares the exact bit error rates (BERs) evaluated against the 
approximate BERs for QAM constellations 

[0018] FIG. 2 is a block diagram illustrating a wireless communication system with N t 
transmit-and N r receive-antennas. 

[0019] FIG. 3 is a block diagram illustrating a two-dimensional (2D) beamformer upon 
which the adaptive multi-antenna transmitter described herein is based. 
[0020] FIG. 4 is a graphic that plots the optimal regions for different signal constellations 
[0021] FIG. 5 is a graph that plots the simulated BER and the approximate BER 



4 



Docket No.: 1008-022US01 /Z04015 

[0022] FIG. 6 is a graph that plots one possible error path in adaptive trellis code modulation 
for 8-state trellis codes. 

[0023] FIG. 7 plots the rate achieved by the adaptive transmitter. 

[0024] FIG. 8 is a plot that illustrates an achieved transmission rate for a system having a 
single receive antenna. 

[0025] FIG. 9 is a plot that illustrates a tradeoff between feedback delay and hardware 
complexity. 

[0026] FIG. 10 is a plot that illustrates an achieved rate improvement with trellis coded 
modulation (TCM). 

[0027] FIG. 1 1 is a plot that illustrates an impact of receive diversity on the adaptive TCM 
techniques. 

[0028] FIG. 12 is a block diagram depicting an equivalent discrete-time baseband model of 

an OFDM wireless communication system. 

[0029] FIG. 13 is plot that illustrates certain thresholds. 

[0030] FIG. 14 is a plot that illustrates a power loading snapshot for certain channel 
realizations. 

[0031] FIG. 15 is a plot illustrating certain threshold distances. 

[0032] FIG. 16 is a plot illustrating a bit loading snapshot for certain channel realizations. 
[0033] FIGS. 17-19 are plots that illustrate certain rate comparisons. 

DETAILED DESCRIPTION 

[0034] This disclosure first presents a unifying approximation to bit error rate (BER) for Al- 
ary quadrature amplitude modulation (M-QAM). Gray mapping from bits to symbols is 
assumed. In order to facilitate adaptive modulation, approximate BERs, that are very simple 
to compute, are particularly attractive. In addition to square QAMs with M= 2 2 \ rectangular 
QAMs with M= 2 2 ' +1 are considered. For exemplary purposes, the disclosure focuses on 
rectangular QAMs that can be implemented with two independent pulse-amplitude- 
modulations (PAMs): one on the In-Phase branch with size , and the other on the 
Quadrature-phase branch with size VM/2. 
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[0035] Consider a non-fading channel with additive white Gaussian noise (AWGN), having 
variance No/2 per real and imaginary dimension. For a constellation with average energy E*, 
let do :=min(|s-s '|) be its minimum Euclidean distance. For each constellation, define a 
constant g as: 

3 

g = for square M-Q AM (1) 

2(M-1) 

g = — - — for rectangular M-QAM. (2) 
5M -4 

The symbol energy E s is then related to ^ through the identity: 



(3) 

The following unifying BER approximation for all QAM constellations can be 



adopted: 



P b * 0.2 exp 



|2 \ 



4N0 



(4) 



which can be re-expressed as: 

( gEs) 



P b » 0.2 exp 



NO J 



(5) 



[0036] BPSK is a special case of rectangular QAM with M= 2, corresponding to g = 1 . 
Hence, no special treatment is needed for BPSK. We next verify the approximate BER. 
[0037] FIG. 1 is a graph that compares the exact BERs evaluated against the approximate 
BERs for QAM constellations with M = 2\ i e [1,8]. The approximation is within two dBs, 
for all constellations at ?t> < 10" 2 , as confirmed by FIG. 1. 

[0038] FIG. 2 is a block diagram illustrating a wireless communication system with N t 
transmit-and Af r receive-antennas. Focusing on flat fading channels, let h MV denote the 
channel coefficient between the jith transmit- and the vth receive- antenna, where ^ e [1,M] 
and v e [lyN T ]. Channel coefficients may be collected in an N t x N r channel matrix H having 
(fi, v)th entry h^. For each receive antenna v, the channel vector h v :=[h\ v , . . h^tv] T is 
defined. 
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[0039] The wireless channels are slowly time- varying. The receiver obtains instantaneous 
channel estimates, and feeds the channel estimates back to the transmitter regularly. Based 
on the available channel knowledge, the transmitter adjusts its transmission to improve the 
performance, and increase the overall system throughput. The disclosure next specifies an 
exemplary channel feedback setup, and develops an adaptive multi-antenna transmission 
structure. 

Channel Mean Feedback 

[0040] For exemplary purposes, the disclosure focuses on channel mean feedback, where 
spatial fading channels are modeled as Gaussian random variables with non-zero mean and 
white covariance conditioned on the feedback. Specifically, an assumption may be adopted 
that transmitter x models channels x as: 



(H,cr * J parameterizes the partial CSI, which is updated regularly given feedback information 
from the receiver. 

[0041] The partial CSI parameters (h,ct^ ) can be provided in many different ways. For 
illustration purposes, a specific application scenario with delayed channel feedback is 
explored and used in our simulations. 

[0042] With regard to delayed channel feedback, it can be assumed that: i) the channel 
coefficients {h^} N t N r 



are independent and identically distributed with Gaussian distribution CN\0,cjI j; ii) the 

channels are slowly time varying according to Jakes' model with Doppler frequency fd\ and 
iii) the channels are acquired perfectly at the receiver and are fed back to the transmitter with 
delay x, but without errors. Perfect channel estimation at the receiver (with infinite 
quantization resolution), and error- free feedback, which can be approximated by using error- 
free control coding and ARQ protocol in feedback channel feedback H/- is drawn from the 
same Gaussian process as H, but in t seconds ahead of H. The corresponding entries of Rf 



H = H + H, 



(6) 



where H is the conditional mean of H given feedback information, and 

~ CN{p N xNr N r G 2 E I Nf ) is the associated zero-mean error matrix. The deterministic pair 
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and H are then jointly zero-mean Gaussian, with correlation coefficient p :-Jo(27i/^r) 
specified from the Jakes' model, where Jo( ) is the zeroth order Bessel function of the first 
kind. For each realization of H/, the parameters needed in the mean feedback model of (6) 
are obtained as: 

H = E^|Hj=PHr o>G,k-\p\ 2 ) (7) 

Adaptive Two Dimensional Transmit-Beamforming 

[0043] FIG. 3 is a block diagram illustrating a two-dimensional (2D) beamformer upon 
which the adaptive multi-antenna transmitter described herein is based. Depending on 
channel feedback, the information bits will be mapped to symbols drawn from a suitable 
constellation. The symbol stream s(n) will then be fed to the 2D beamformer, and 
transmitted through N t antennas. The 2D beamformer uses the Alamouti code to generate 
two data streams s \ (n) from the original symbol stream s(n) as follows: 

~s : (2n) $,(2/2 + 1)" 
s 2 (ln) s 2 (ln + \) 

The total transmission power E s is allocated to these streams: 5iE s to s \ («), and 52E S = (1 - 
5i)E s to s 2 (n\ where 5i e [0,1]. Each power-loaded symbol stream is weighted by an A^x 1 
beam-steering vector X(n) := \x x (/2 ),...., Xn, at ^ e n ^ ti me s ^ ot * s: 

X(n) = J, (n^u* + s 2 (n^u 2 ) (9) 

[0044] Moving from single to multiple transmit-antennas, a number of spatial multiplexing 
and space time coding options are possible, at least when no CSI is available at the 
transmitter. An adaptive transmitter based on a 2D beamforming approach may be 
advantageous for a number of reasons. 

[0045] For example, based on channel mean feedback, the optimal transmission strategy (in 
the uncoded case) is to combine beamforming (with N t >2 beams) with orthogonal space 
time block coding (STBC), where the optimality pertains to an upper-bound on the pairwise 
error probability, or an upper-bound on the symbol error rate. However, orthogonal STBC 
loses rate when > 2, which is not appealing for adaptive modulation whose ultimate goal is 
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to increase the data rate given a target BER performance. On the other hand, the 2D 
beamformer can achieve the best possible performance when the channel feedback quality 
improves. Furthermore, the 2D beamformer is suboptimal only at very high SNR. In such 
cases, the achieved BER is already below the target, rendering further effort on BER 
improvement by sacrificing the rate unnecessary. In a nutshell, the 2D beamformer is 
preferred because of its full-rate property, and its robust performance across the practical 
SNR range. 

[0046] In addition, the 2D beamformer structure is general enough to include existing 
adaptive multi-antenna approaches; e.g., the special case of (N h N r ) = (2, 1) with perfect CSI 
considered. To verify this, the channels can be denoted as h\ and A2. Setting (81, 82) = (1,0), 
uj = [1,0] T when \h\\ > IA2I and ui = [0,1] T otherwise, our 2D beamformer reduces to the 
selective transmitter diversity (STD) scheme. Setting (81, 8 2 ) = (1,0) and ui = [h u h 2 ] T I 

^|A,| 2 +|A 2 | 2 , our 2D beamformer reduces to the transmit adaptive array (TxAA) scheme. 

Finally, setting (81, 82) = (1/2, V2), m = [1 ,0] T and u 2 = [0,1] T leads to the space time transmit 
diversity (STTD) scheme. 

[0047] Moreover, due at least in part to the Alamouti structure, improved receiver processing 
can readily be achieved. The received symbol y y (n) on the vth antenna is: 
y v {n)=x T (n)h ¥ +w v (n) 

= ^(«X/^" Wl H /2 v +^n\fcu?h v +w v (4 (10) 
where w v (n) is the additive white noise with variance No/2 per real and imaginary 
dimension. Eq. (10) suggests that the receiver only observes two virtual transmit antennas, 
transmitting s\(n) and si(n) , respectively. The equivalent channel coefficient from the jth 

virtual transmit antenna to the vth receive-antenna is ^fs~u^h v Supposing that the channels 

remain constant at least over two symbols, the linear maximum ratio combiner (MRC) is 
directly applicable to our receiver, ensuring maximum likelihood optimality. Symbol 
detection is performed separately for each symbol; and each symbol is equivalently passing 
through a scalar channel with 
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(11) 



where w(n) has variance No/2 per dimension. The transmitter influences the quality of the 
equivalent scalar channel heq V through the 2D beamformer adaptation of (81, 82, ui, 112). 
[0048] As yet another advantage, the combination of Alamouti's coding and transmit- 
beamforming may be advantages in view of emerging standards. 



Adaptive Modulation Based on 2D Beamforminz 

[0049] Returning to FIG. 2, based on mean feedback, transmitter 4 controls eigen- 
beamformer x to adjust the basis beams (ui and U2), the power allocation (81 and 82), and the 
signal constellation of size M and energy E s , to maximize the transmission rate while 
maintaining the target BER:Pb,target- For purposes of illustration, QAM constellations are 
adopted, N different QAM constellations with M t = 2\ where i = 1, 2, . . ., N, as those 
exemplified above, are assumed. Correspondingly, the constellation-specific constant g can 
be denoted as g/. The value of g/ is evaluated from (1), or (2), depending on the constellation 
M^ When the channel experiences deep fades, the adaptive design may be allowed to 
suspend data transmission (this will correspond to Mo = 0). 

[0050] Under these assumptions, transmitter 4 perceives a random channel matrix H as in 
(6). The BER for each realization of H is obtained from (1 1) and (5) as: 



P 6 (//,M,)*0.2exp 



eqv N 



(12) 



Since the realization of H is not available, the transmitter relies on the average BER: 



? b {Mi ) = E{P, (H, M f )} * 0.2Ej exp 



'eqv 



N 



(13) 



0 / 



and uses P& (M, ) as a performance metric to select a constellation of size M t . 



Let the eigen decomposition of HH be: 
HH H =U H D H U%, £> H := diag(\,X 2 ,...,^ t ) 



(14) 
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channel H in (6), we term them as eigen-directions, or, eigen-beams. 

[0051] For any power allocation with 5, > 8 2 > 0 the optimal ui and U2 minimizing P^(M / ) 

can be expressed as: 



In other words, the optimal basis beams for our 2D beamformer are eigen-beams 
corresponding to the two largest eigenvalues X\ and A^. Hereinafter, the adaptive 2D 
beamformer is referred to as a 2D eigen-beamformer. 

Adaptive Power Allocation between Two Beams 

[0052] With the optimal eigen-beams, the average BER can be obtained similarly, but with 
only two virtual antennas. Formally, the expected BER is: 



For a given p f , the optimal power allocation that minimizes (16) can be found in closed-form, 
following derivations. Specifically, with two virtual antennas, we simplify to: 



u, =u 



(15) 




where for notational brevity, we define 



(17) 




(18) 



where <5 2 ° is obtained from: 
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(Ny £ +2X l lN r <j 2 E +^) 2 



(19) 



1 + 



The optimal solution guarantees that 5i > 82 > 0; thus, more power is allocated to the stronger 
ei gen-beam. If two eigen-beams are equally important (k\ = X2), the optimal solution is 5i = 
5 2 = Vi. On the other hand, if the channel feedback quality improves as a] -> 0 , 5i and 5 2 
are constellation dependent. 

Adaptive Rate Selection with Constant Power 

[0053] With perfect CSI, using the probability density function (p.d.f.) of the channel fading 
amplitude, the optimal rate and power allocation for single antenna transmissions has been 
provided. Optimal rate and power allocation for the multi-antenna transmission described 
herein with imperfect CSI turns out to be much more complicated. Constant power 
transmission can be, therefore, focused on, and only the modulation level is adjusted. 
Constant power transmission simplifies the transmitter design, and obviates the need for 
knowing the channel p.d.f.. 

[0054] With fixed transmission power and a given constellation, transmitter 4 computes the 
expected BER with optimal power splitting in two eigen-beams, per channel feedback. The 
transmitter then chooses the rate-maximizing constellation, while maintaining the target 
BER. Since the BER performance decreases monotonically with the constellation size, the 
transmitter finds the optimal constellation to be: 



This equation can be solved by trial and error; starting with the largest constellation A/, = M N , 
and then decreasing i until the optimal M t is found. 

[0055] Although there are N t N r entries in H, constellation selection depends only on the first 
two ei gen- values X\ and Aa- The two dimensional space of (kifa) can be split in N+ 1 




(20) 



disjoint regions }" 0 each associated with one constellation. Specifically, 
M=M n when (A, 5 A 2 )e D f ,Vz = 0,1,...,^ (21) 
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can be chosen. The rate achieved by system 2 of FIG. 2 is therefore 

/? = 2log 2 (M,)JJ 0( p(A 1> A 2 >/V^. (22) 

where p(k\ y X2) is the joint p.d.f. of X\ and X2. The outage probability is thus: 

P~=JJ**<V^V*2. (23) 

The fading regions can be specified. Since X2 = X\ 9 we have a := A 2 / A, e [0,l] 

To specify the region A in the A^) space, the intersection of A with each straight line can 

be specified as X2 = aX\ where a e [0,l] . Specifically, the fading region D t on each line will 

reduce to an interval. This interval on the line X2 = aX\ will be denoted as \a i {a\a + l(a)), 

during which the constellation M t is chosen. In addition, a 0 (a ) = 0 and a N+l (a) = 00 . The 

boundary points {a, (a)}", remain to be specified. 

[0056] For a given constellation M, and power allocation factors (S x ,<5 2 = 1 - 5! ) the 
minimum value of X\ on the line of ^2 = aX\ can be determined so that P*> (M . ) < ¥ bJarget as: 



8 X P, ^ ad^ N_1 



^1 + 5^,. 1 + <5 2 0J 



0.2 



lp* w [(i+*iAXi+*2A)rj 



(24) 



Since the optimal 5, e [l/2,l]will lead to the minimal A,i that satisfies the BER requirement, 
the boundary point a s (a) can be found as: 

«,W=S./2,,]A 1 («.5„M j ) (25) 

The minimization is a one-dimensional search, and it can be carried out numerically. Having 
specified the boundaries on each line, the fading regions associated with each constellation in 
the two dimensional space can be plotted, as illustrate in further detail below. 
[0057] In the general multi-input multi-output (MIMO) case, each constellation M t is 
associated with a fading region D, on the two dimensional plane (k u A*). Several special 
cases exist, where the fading region is effectively determined by fading intervals on the first 
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eigenvalue A, i. In such cases, the boundary points are denoted as {si}"^- The constellation 
Mi is chosen when \ e [a , , a ,+i ) The following may then be obtained: 

/? = flog 2 (A/ I .)£ 4, p(A 1 >/A 1 

1=1 

= flog 2 (M,.)[F(a (+1 )-F(a,)] 



(26) 



. 1=1 



where F(x):= \* p{X^dXy is the cumulative distribution function (c.d.f) of X\. The outage 
Jo 

becomes: 



(27) 



To calculate the rate and outage, it suffices to determine the p.d.f. of X\ 9 and the boundaries 

i }"i • F° r multiple transmit - and a single receive - antennas, N r =l, and there is only one 
non-zero eigen- value X\ 9 and thus a = X2/A-1 = 0. The boundary points are: 
at =a,(0) V/ = 0,l,...,tf (28) 
where a, (a) is specified in (25). 

[0058] When N r =\, the channel hi is distributed as CN(o, l N ) . With delayed feedback 
considered in Example 2, we have A, = |p| 2 )|A,|| 2 = IpfXjlJ^il w ^ c ^ * s Gamma 
distributed with parameter N t and mean } = |p| 2 N t The p.d.f and c.d.f. of X\ are: 



f ^ Ar ' -n.-i f 



K' 

7—! r-exp 



(29) 



= \-e 



( V 



W J 



(30) 



Plugging (30) and (28) into (26), the rate becomes readily available. 
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[0059] Turning to the MIMO case, the adaptive 2D beamformer described herein subsumes a 
ID beamformer by setting 81 = 1 and 82 = 0. Numerical search is now unnecessary, and 
8 2 = 0 does not depend on a anymore. The following can be simplified: 

a, =A,(a,l,M,.) 

r \ 

(31) 



Hi 



0.2 



P 

y bjzrget 1 



The fading region thus depends only on X\ . 

[0060] FIG. 4 is a graphic that plots the optimal regions for different signal constellations 
with ? b =10~\E S /N 0 = \5dB and p = 0.9 . As the constellation size increases, the 
difference between ID and 2D beamforming decreases. 

[0061] With perfect CSI (ct g 2 = 0.H = h) the optimal loading ends up being 5, = 1,<5 2 = 0 . 
Therefore, the optimal transmission strategy in this case is ID eigen-beamforming. The 
results apply to ID beamforming, but with g\ = 0 Specifically, we simplify to 



P 6 (A/,)«0.2exp 
and to 



-A, 5 



N, 



(32) 



0 J 



OLi =2,(^,1,^) = 



1 



-in 



' 0.2 ^ 

P 



(33) 



Eq. (32) reveals that the MIMO antenna gain is introduced solely through X\, the maximum 
eigenvalue of (or, HH H ). 

[0062] Notice that with perfect CSI, one can enhance spectral efficiency by adaptively 
transmitting parallel data streams over as many as N t eigen-channels of. These data streams 
can be decoded separately at the receiver. However, this scheme can not be applied when the 

available CSI is imperfect, since the eigen-directions of HH H are no longer the eigen- 
directions of the true channel HH H . As a result, these parallel streams will be coupled at the 
receiver side, and will interfere with each other. This coupling calls for higher receiver 
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complexity to perform joint detection, and also complicates the transmitter design, since no 
approximate BER expressions are readily available. 

Adaptive Trellis Coded Modulation 

[0063] Next, coded modulation is considered. Recall that each information symbol s(n) is 
equivalently passing through a scalar channel in the proposed transmitter. Thus, 
conventional channel coding can be applied. For exemplary purpose, trellis coded 
modulation (TCM) is focused on, where a fixed trellis code is superimposed on uncoded 
adaptive modulation for fading channels. The single antenna design with perfect CSI can be 
extended to the MIMO system described herein with partial, i.e., imperfect, CSI. 
[0064] For adaptive trellis coded modulation, out of n information bits, k bits pass through a 
trellis encoder to generate k + r coded bits. A constellation of size 2 n+r is partitioned into 2 k+r 
subsets with size 2 n k each. The k + r coded bits specify which subset to be used, and the 
remaining n-k uncoded bits specify one signal point from the subset to be transmitted. The 
trellis code may be fixed, and the signal constellation may be adapted according to channel 
conditions. Different from the uncoded case, the minimum constellation size now is 2 k+r with 
each subset containing only one point. With a constellation of size M h only log2(M,) - r bits 
are transmitted. 

BER approximation for AWGN channels 

[0065] Let df ree denote the minimum Euclidean distance between any pair of valid 
codewords. At high SNR, the error probability resulting from nearest neighbor codewords 
dominates. The dominant error events have probability: 




(34) 




where N(djr ee ) is the number of nearest neighbor codewords with Euclidean distance dfr ee * 
Along with (4) for the uncoded case, the BER can be approximated by: 
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dl 




c 3 exp - 



free 



(35) 




where the constants C2 and c$ need to be determined. For each chosen trellis code, one 
constant C3 may be used for all possible constellations to facilitate the adaptive modulation 
process. 

[0066] For each chosen trellis code and signal constellation M h the ratio of d\ ee I d\ is fixed. 
For each prescribed trellis code, we define: 
d\ ee 

g i = — — g { , for the constellation M i . (36) 
d o 

Substituting (36) and (3) into (35), the approximate BER for constellation A/j- can be obtained 

as: 



The four-state trellis code can be checked with k = r=l. The constellations of size 

M i = 2' , V* g [2,8] are divided into four subsets, following the set partitioning procedure. 

Let dj denote the minimum distance after the /th set partitioning. For QAM constellations, 



With M = 4, no parallel transition exists, and we have d \ ee = d\ + 2d\ = 5d\ . We find the 

parameter c 3 = 1 .5 = 0.375 N{djr ee ) for the four-state trellis, where N(dfr ee ) = 4. 

[0067] FIG. 5 is a graph that plots the simulated BER and the approximate BER in (37). The 

approximation is within 2dB for BER less than 10" 1 . 

[0068] FIG. 6 is a graph that plots the trellis for the eight-state trellis code, which may also 
be checked with k = 2 and r = 1 . The constellations of size M = 2 l , Vie are divided into 
eight subsets. The subset sequences dominate the error performance with 
d lee = d l + sd \ = 5d o for a11 constellations. We choose c 3 = 6 = 0.375^^) for the eight- 
state trellis code, where N{djr ee ) - 16. The approximation is within 2dB for BER less than 10" 



r 




P 6,rcA/(^/)«^exp - 



(37) 



we have d /+l Id } = V2 . When M> 4, parallel transitions dominate with d \ ee -d\- Ad\ . 
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Adaptive TCMfor fading channels 

[0069] The adaptive coded modulation with mean feedback may now be specified. Since the 
transmitted symbols are correlated in time, a time index t is explicitly associated for each 
variable e.g., H(f) is used to denote the channel perceived at time t. The following average 
error probability at time t can be calculated based on (1 1) and (37): 



« c 3 E< exp 



/V 0 J 



(38) 



At each time t when updated feedback arrives, transmitter 4 automatically selects the 
constellation: 

M(/)=argmax K™(M,t)<? b ^ et (39) 

By the similarity of (37) and (5), we end up with an uncoded problem with constellation M, 
having a modified constant g] and conveying log2(M) - r bits. 

[0070] However, distinct from uncoded modulation, the coded transmitted symbols are 
correlated in time. Suppose that the channel feedback is frequent. The subset sequences may 
span multiple feedback updates, and thus different portions of one subset sequence may use 
subsets partitioned from different constellations. The transmitter design in (39) implicitly 
assumes that all dominating error events are confined within one feedback interval. 
Nevertheless, this design guarantees the target BER for all possible scenarios. Since the 
dominating error events may occur between parallel transitions, or between subset sequences, 
this disclosure explores all of the possibilities: 

1) Parallel transitions dominate: The parallel transitions occur in one symbol interval, 
and thus depend only on one constellation selection. The transmitter adaptation in (39) is in 
effect. 

2) Subset sequences dominate: The dominating error events may be limited to one 
feedback interval, or, may span multiple feedback intervals. If the dominating error events 
are within one feedback interval, the transmitter adaptation in (39) is certainly effective. On 
the other hand, the error path may span multiple feedback intervals, with different portions of 
the error path using subsets partitioned from different constellations. 
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[0071] We focus on any pair of subset sequences Ci and C2. For brevity, it is assumed that 
the error path spans two feedback intervals (or updates), at time t\ and fa Different 
constellations are chosen at time t\ and /2 5 resulting in different d\ (t x ) and d\ (t 2 ) As 
illustrated in FIG. 6, the distance between ci and C2 can be partitioned as: 
d 2 ( c i 9 c 2 \*\ 9*2 ) = d 2 (t\ ) + d 2 (t 2 ) The contribution of d 2 (t x ) at time t\ is the minimum distance 
between subsets £ 0 (f, ) and £ 2 (t x ) plus the minimum distance between subsets £ 0 (f, )and 
M x )M,d 2 (f t ) = d x \t x ) + d 2 0 (t x ) = 3d 2 0 (t x ). Similarly, we have d 2 (t 2 ) = d x 2 (t 2 ) = 2d 2 (t 2 ) 

[0072] Now, two virtual events can be constructed that the error path between ci and C2 
experiences only on feedback: One at t\ and the other at fa For / = 1,2, the average pairwise 
error probability is defined as: 





f { 


P(c, ^ c. 


f,.) = 0.5Ejexp 



eqv , 



(40) 



Next, the following constants are defined: 



b 2 .= 



(41) 



d\c„c 2 \tj 2 " d\c„c 2 \t 2 ) 
It is clear that b\ + bi = 1 , and 0 <b\, bz<\. 

[0073] When the error path between cl and c2 spans multiple feedback intervals, the average 
PEP decreases relative to the case of one feedback interval. Since the conditional channels at 
different times are independent, 



E{p(c, - 
= 0.5E 



c 2 |r, 



exp 



»' 2 )} 



4N n 



(42) 



xE 



<0.5 



exp 



f -h 2 eqv (t 2 )d\t 2 )" 



AN, 



P(c, ->c 2 |/,) 



o 



0.5 



P(c, -»c 2 r 2 ) 



< max(P(c, -» c 2 /,),P(c, 



0.5 



c 2 \t 2 



)) 
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where in deriving (42), the inequality in (47) (proved below) is used. Eq. (42) reveals that 
the worst case happens when the error path between subset sequences spans only on 
feedback. In such cases, however, we have guaranteed the average BER in (39), for each of 
the feedback intervals, the average pairwise error probability decreases, and thus the average 
BER (proportional to the dominating pairwise error probability is approximated in (35)) is 
guaranteed to stay below the target. 

[0074] In summary, the transmitter adaptation in (39) guarantees the prescribed BER. With 
perfect CSI, this adaptation reduces to a point where do is maintained for each constellation 
choice. The techniques described herein are simpler in comparison to some conventional 
approaches in the sense that the described techniques do not need to check all distances 
between each pair of subsets. 
Examples 

[0075] In simulation purposes, the channel setup is adopted with a\ = 1 . Recall that the 
feedback quality a] is related to the correlation coefficient J 0 (27rf d r) via a] = 1- | p | 2 . . 
With p = 0.95,0.9,0.8, we have a 2 e = -10. 1,-7.2,-4,4^/2? . For fair comparison among 

different setups, the average received SNR is used in all plots and defined as: 
averageSNR:=(l-J> o JE s /N 0 (43) 

[0076] FIG. 7 plots the rate achieved by the adaptive transmitter 4 with Pb.target =10" 3 , N t = 2, 
N r =\ 9 and p = 1, 0.95, 0.9, 0.8, 0. As illustrated in FIG. 7, it is clear that the rate decreases 
relatively fast as the feedback quality drops. 

[0077] For comparison, FIG. 7 also plots the channel capacity with mean feedback, using the 
semi-analytical result. As shown in FIG. 7, the capacity is less sensitive to channel 
imperfections. The capacity with perfect CSI is larger than the capacity with no CSI by 
about log2(N t ) = 1 bit at high SNR, as predicted. With p = 0.9, the adaptive uncoded 
modulation is about 1 ldB away from capacity. 

[0078] FIG. 8 is a plot that illustrates the achieved transmission rate with Nr = 1, Pb.target = 
10" 3 , and p = 0.9. As shown in FIG. 8, the achieved transmission rate increases as the 
number of transmit antennas increases. The largest rate improvement occurs when TV, 
increases from one to two. 
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[0079] FIG. 9 is a plot that illustrates the tradeoff between feedback delay and hardware 
complexity. As illustrated, one tradeoff value is fdT= 0.01 for single antenna transmissions. 
FIG. 9, verifies that with two transmit antennas, the achieved rate with fdT= 0.1 ( p = 0.904) 

coincides with that corresponding to one transmit antenna with perfect CSI (JdT <0.01); 
hence, more than ten times of feedback delay can be tolerated. The rate with N t = 4 and fdT 
= 0.16 (p = 0.76) is even better than that of N ( = 1 with perfect CSI. To achieve the same rate, 
the delay constraint with single antenna can be relaxed considerably by using more transit 
antennas, an interesting tradeoff between feedback quality and hardware complexity. FIG. 9 
also reveals that the adaptive deign becomes less sensitive to CSI imperfections, when the 
number of transmit antenna increases. 

[0080] FIG. 10 is a plot that illustrates the achieved rate improvement with trellis coded 
modulation. In this example, the four-state and eight-state trellis codes described above were 
tested. First Pb.target was set tolO" 6 , N t = 2, N r =1 . When the feedback quality is near perfect (p 
= 0.99), the rate is considerably increased by using trellis coded modulation instead of 
uncoded modulation, in agreement with the prefect CSI case. However, the achieved SNR 
gain decreases quickly as the feedback quality drops, as shown in FIG. 1 0. This can be 
predicted, since increasing the Euclidean distance by TCM with set partitioning is less 
effective for fading channels ( p < 1) than for AWGN channels ( p =1). If affordable, 

coded bits can be interleaved to benefit from time diversity, as suggested. This is suitable for 

the 8-state TCM, where the subset sequences dominate the error performance. 

[0081] On the other hand, the Euclidena distance becomes the appropriate performance 

measure, when the number of receive antennas increases, as established. The SNR gain 

introduced by TCM is thus restored, as shown in Fig. 1 1 with N r = 2, 4. 

[0082] Comparing FIG. 10 with FIG. 7, one can observe that the adaptive system is more 

sensitive to noisy feedback when the prescribed bit error rate is small (10" 6 ) as opposed to 

large (10 -3 ). 

[0083] In accordance with these techniques, adaptive modulation for multi-antenna 
transmissions with channel mean feedback can be achieved. Based on a two dimensional 
beamformer, the proposed transmitter optimally adapts the basis beams, the power allocation 
between two beams, and the signal constellation, to maximize the transmission rate while 
guaranteeing a target BER. Both uncoded and trellis coded modulation have been addressed. 
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Numerical results demonstrated the rate improvement enabled by adaptive multi-antenna 
modulation, and pointed out an interesting tradeoff between feedback quality and hardware 
complexity. The proposed adaptive modulation maintains low receiver complexity thanks to 
the Alamouti structure. 

Adaptive Orthogonal Frequency Division (OFDM) Multiplexed Transmissions 
[0084] The techniques described above for adaptive modulation over MIMO flat-fading 
channels are hereinafter extended to adaptive MIMO-OFDM transmissions over frequency- 
selective fading channels based on partial CSI. As further described below, an OFDM 
transmitter applies the adaptive two-dimensional space-time coder-beamformer on each 
OFDM subcarrier, with the power and bits adaptively loaded across subcarriers, to maximize 
transmission rate under performance and power constraints. 

[0085] This problem is challenging because information bits and power should be optimally 
allocated over space and frequency, but its solution is equally rewarding because high- 
performance high-rate transmissions can be enabled over MIMO frequency-selective 
channels. As further described, the techniques include: 

• Quantification of partial CSI for frequency selective MIMO channels, and 
formulation of a constrained optimization problem with the goal of maximizing rate 
for a given power budget, and a prescribed BER performance. 

• Design of an optimal MIMO-OFDM transmitter as a concatenation of an adaptive 
modulator, and an adaptive two-dimensional coder-beamformer. 

• Identification of a suitable threshold metric that encapsulates the allowable power and 
bit combinations, and enables joint optimization of the adaptive modulator- 
beamformer. 

• Incorporation of algorithms for joint power and bit loading across MIMO-OFDM 
subcarriers, based on partial CSI. 

• Illustration of the tradeoffs emerging among rate, complexity, and the reliability of 
partial CSI, using simulated examples. 

[0086] FIG. 12 is a block diagram of a wireless communication system 30 in which an 
adaptive MIMO-OFDM transmitter 32 applies adaptive two-dimensional coder-beamformers 
34A-34N across each OFDM subcarrier, along with an adaptive power and bit loading 
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scheme. In particular, FIG. 12 depicts an equivalent discrete-time baseband model of an 
OFDM wireless communication system 30 equipped with K subcarriers, N t transmit-, and N r 
receive-antennas, signaling over a MIMO frequency selective fading channel. Per OFDM 
sub-carrier, transmitter 32 deploys one of adaptive two-dimensional (2D) coder-beamformers 
34A-34N. Each of 2D coder-beamers 34 combines Alamouti's space time block coding 
(STBS) with transmit beamforming. Higher-dimensional coder-beamformers based on 
orthogonal STBS with N t > 2, can be also applied, as detailed below. However, the 2D 
coder-beamformers 34 strike desirable performance-rate-complexity tradeoffs, and for this 
reason, the 2D case is illustrated for exemplary purposes. 

[0087] To apply the 2D coder-beamformer per subcarrier, two consecutive OFDM symbols 
are paired to form on space-time coded OFDM block. Due to frequency selectivity, different 
subcarriers experience generally different channel attenuation. Hence, in addition to 
adapting the 2D coder-beamformer on each subcarrier, the total transmit-power may also be 
judiciously allocated to different subcarriers based on the available CSI at transmitter 32. 
[0088] Let n be used to index space time coded OFDM blocks (pairs of OFDM symbols), 
and let k denote the subcarrier index; i.e., k e {0,1,..., K - 1} . Let P[n;k] stand for the power 
allocated to the kth subcarrier of the nth block. Then, depending on P[n;k], a constellation 
(alphabet) /4[rc;A:] consisting of M[n;k] constellation points is selected. In addition to square 
QAMs with M[n;k] = 2 2 \ that have been used extensively in adaptive modulation, rectangular 
QAMs with M[n;k] = 2 2/+1 are also considered. Similar to the previous analysis, the 
subsequent analysis focuses on rectangular QAMs that can be implemented with two 

independent PAMs: one for the In-phase branch with size ^jlM\n\k\ and the other for the 

Quadrature-phase branch with size ^M\n:k\l2 as those studied. Due to the independence 

between I-Q branches, this type of rectangular QAM incurs modulation and demodulation 
complexity similar to square QAM. 

[0089] For each block time-slot n, the input to each of 2D coder-beamformer 34 used per 
subcarrier entails two information symbols, sj[n;k] and S2[n;k], drawn from A ^ n;k \ with each 
one conveying 
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b[n\k] = log 2 (M[n;kf) (44) 
bits of information. These two information symbols will be space-time coded, power-loaded, 
and multiplexed by the 2D beamformer to generate an AT, x 2 space-time (ST) matrix as: 



X[n; k] = [ul [n; k\u 2 [n; fc]] * 

where S[n;k] is the well-known Alamouti ST code matrix; U[n;A:] is the multiplexing matrix 
formed by two N t x 1 basis-beam vectors ui[n;&] and U2[n;k]; and D[n;k] is the 
corresponding power allocation matrix on these two basis-beams with 
0 < <5j [n; k\ 8 2 [n; k] < 1 , and <5j [n; k] + 8 2 [n; k] = l. In the two time slots corresponding to the 
two OFDM symbols involved in the nth ST coded block, the two columns of X[n;k] are 
transmitted on the kth subcarrier over N t transmit-antennas. 

[0090] For purposes of illustration, it is assumed that the MIMO channel is invariant during 
each space-time coded block, but is allowed to vary form block to block. Let 

Vv W := K,v h^],..., 1%^ v [n; L]] r be the baseband equivalent FIR channel between the ^ith 

transmit- and the vth receive-antenna during the nth block, where 1 < |i< N h 1 < v < N n and L 

is the maximum channel order of all N t N r channels. With f k := [l, e /27I * / " e ^ 2M/N J the 

frequency response of h^[n] on the kth subcarrier is: 

"„vM = thJn;iy*>*" N = ffhjn] (46) 

/=0 

[0091] Let H[/i;A:] be the N t x N r matrix having H ^\n\ k] as its (n, v)th entry. To isolate the 

transmitter design from channel estimation issues at the receiver, we suppose that the 
receiver has perfect knowledge of the channel //[«; k\ V/2, k . 

[0092] With Y[n;k] denoting the nth received block on the kth subcarrier, we can express the 
input-output relationship per subcarrier and ST coded OFDM block as 
Y[n; k]=H T [n\ k]x[n; k] + W[n; k] 

= H T [n;kp* [n; k]o[n\ k]s[n; k] + W[n\ k] (47) 



>!M o 
0 ^S 2 [n;k\ 





Si[n;k] 


-s' 2 [n;k] 




s 2 [n;k] 


sl[n;k] 



, (45) 
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where W[n;k] stands for the additive white Gaussian noise (AWGN) at the receiver with 
each entry having variance No/2 per real and imaginary dimension. Based on (47), one can 
view our coded-beamformed MIMO OFDM transmissions per subcarrier as an Alamouti 
transmission with ST matrix S[n;k] passing through an equivalent channel matrix B [n;k] := 
H [n;k] U [n;k] D[n;k], With knowledge of this equivalent channel and maximum ratio 
combining (MRC) at receiver 38, it can be verified that each information symbol is thus 
passing through an equivalent scalar channel with I/O relationship 

h fc*]= Kgv [n\ [n\k] + w, [n; k\ i = 1,2, (48) 
where the equivalent channel is: 

M*;*] = !*MI, W^I^t^MI* J" (49) 

Partial CSI for Frequency-Selective MIMO Channels 

[0093] Mean feedback has been described above in reference to flat-fading multi-antenna 
channels to account for channel uncertainty at the transmitter, where the fading channels are 
modeled as Gaussian random variables with non-zero mean and white covariance. This 
mean feedback model is adopted for each OFDM subcarrier of the OFDM system 30 of FIG. 
12. Specifically, it is assumed that on each subcarrier k, transmitter 32 obtains an un-biased 

channel estimate H[n; k] either through a feedback channel, or during a duplex mode 
operation, or, by predicting the channel from past blocks. Transmitter 32 treats this "nominal 
channel" H[h;A:] as deterministic, and in order to account for CSI uncertainty, it adds a 

"perturbation" term. The partial CSI of the true N t x N r MIMO channel H[n;k] at transmitter 
32 is thus perceived as: 

H[n; k] = H[n\ k] +z[n\k\k = 0,1,..., K - 1, (50) 
where E[n;k] is a random matrix Gaussian distributed according to 

CN(p NxNr N r o] \n\k\f Nt ) The variance g] \n\ k] encapsulates the CSI reliability on the kth 
subcarrier. 

[0094] Suppose that the FIR channel taps have been acquired perfectly at the receiver, and 
are fed back to the transmitter with a certain delay, but without errors thanks to powerful 
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error control codes used in the feedback. Let us also assume that the following conditions 
hold true: 

i) The L + 1 taps {h^ [«;/]} f =0 in [n] are uncorrected, but not necessarily 
identically distributed (to account for e.g., exponentially decaying power profiles). Each tap 
is zero-mean Gaussian with variance cr^[/] Hence, 

K M~ C M<>>E J^ere^ „ ■= diag{a]M.WM 

ii) The FIR channels {^^["]}^u=i between different transmit- and receive-antenna 

pairs are independent. This requires antennas to be spaced sufficiently far apart from each 
other. 

iii) All FIR channels have the same total energy on the average 

a\ = tr(^ MV }, Vji, v. This is reasonable in practice, since the multi-antenna transmissions 

experience the same scattering environment. 

iv) All channel taps are time varying according to Jakes' model with Doppler 
frequency fj. 

[0095] At the nth block, assume the channel feedback {h^ [n^^; =] , that corresponds to the 
true channels N& blocks earlier is obtained; i.e. [n] = [n - N b } Assume each space time 
coded block has time duration T b seconds. Then, h£ v [n] is drawn from the same Gaussian 
distribution as h^ v [n] 9 but AfcTi seconds ahead. Let p := J 0 (27tf d N b T b ) denote the 
correlation coefficient specified by Jakes' model, where Jo( ) is the zeroth order Bessel 
function of the first kind. The MMSE predictor of h^ v [n], and i), is h^ v [n] = p h [n] . To 
account for the prediction imperfections, the transmitter forms an estimate h [n] as: 

where ^ v [n] is the prediction error. Under i), it can be verified that 

U*]~av(o,(i-| P | 2 )£ J. (52) 

[0096] The mean feedback model on channel taps described above can be translated to the 
CSI on the channel frequency response per subcarrier. Based on this, the matrices with 
(^v)th entries can be obtained: 
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[#[»;*IL = f"KvM[ H k k iv = /^v,«n4s[n;%, = /^[iij Using i), ii), and 
(52), it can be verified that E[n;k] has covariance matrix AT..(l - |p| 2 . Notice that in 

this case, the uncertainty indicators cr £ 2 [n; k] = fl - |p| 2 ]br^ are common to all subcarriers. 

[0097] Notwithstanding, the partial CSI has also unifying value. When K = 1, it boils down 
to the partial CSI for flat fading channels. With a \ = 0 , it reduces to the perfect CSI of the 
MIMO setup considered. When N t = N r = 1, it simplifies to the partial CSI feedback used for 
SISO FIR channels. Furthermore, with N t = N r = 1 and a] = 0 it is analogous to perfect CSI 
feedback for wireline DMT channels. 

[0098] One objective is to optimize the MIMO-OFDM transmissions in FIG. 12, based on 
partial CSI available at the transmitter. Specifically, we may want to maximize the 
transmission rate subject to a power constraint, while maintaining a target BER performance 

on each subcarrier. Let BER\n\ k] denote the perceived average BER at the transmitter on 

the kth subcarrier of the nth block, and BERo [k] stand for the prescribed target BER on the 
Ath subcarrier. The target BERs can be identical, or, different across subcarriers, depending 
on system specifications. Recall that each space-time coded block conveys two symbols, 
s x [n; k\s 2 \n\k\ and thus 2b[n;k] bits of information on the Ath subcarrier. One goal is thus 
formulated as the following constrained optimization problem: 

maximize 2^6[/2;&] 

*=o 

cl BER[n;k]= BERo[k\Vk 

K-\ 

subject to c2 £ p [ n > k \ = P u*a and P V\ k \ * °> v * ( 53 ) 

k=0 

c3 Z?[/2;A:]e{0,l,2,3,4,5,6,...}, 

where P tota i is the total power available to the transmitter per block. 

[0099] The constrained optimization in (10) calls for joint adaptation of the following 

parameter: 

• power and bit loadings {P\n\ k\b\n\kty*^ across sub-carriers; 

• basis-beams per subcarrier {u x \n\ k\u 2 [n; A]}^ 1 
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• power splitting between the two basis-beams per subcarrier {8 ] [n; k\ 8 2 [n; k]*J . 

[0100] Compared with the constant-power transmissions over flat-fading MIMO channels, 
the problem here is more challenging, due to the needed power loading across OFDM 
subcarriers, which in turn depends on the 2D beamformer optimization per subcarrier. 
Intuitively speaking, our problem amounts to loading power and bits optimally across space 
and frequency, based on partial CSI. 

Adaptive MIMO-OFDM With 2D Beamforminz 

[0101] For notational brevity, we drop the block index n, since our transmitter optimization 
is going to be performed on a per block basis. Our transmitter includes an inner stage 
(adaptive beamforming) and an outer stage (adaptive modulation). Instrumental to both 
stages is a threshold metric, d\ [k] 9 which determines allowable combinations of (P[k],b[k]), 

so that the prescribed BERo [k] is guaranteed. 

[0102] Next, the basis beams u x [k\ u 2 [k], and the corresponding percentages 8 X [k\8 2 [k] of 
the power P[k] are determined for a fixed (but allowable) combination of (P[k], b[k]). Let Ts 
be the OFDM symbol duration with the cyclic prefix removed, and without loss of generality, 
let us set Ts == 1 . With this normalization, the constellation chosen for the kth subcarrier has 
average energy s s [k] = P[k]T s = P[k\ and contains M[k] = 2 b[k] signaling points. If [k] 
denotes the minimum square Euclidean distance for this constellation, we will find it 
convenient to work with the scaled distance metric d 2 [k] := d^ in [k]/ 4 , because for QAM 
constellations, it holds that, 

dL [k] = 4d 2 [k] = 4g{b[k])e s [k] = 4g(*[*M4 (54) 
where the constant g(b) depends on whether the chosen constellation is rectangular, or, 
square QAM: 

-f-,6=l,3,5,... 

g(*):=p;- 46 (55) 

— 7 — ,6=2,4,6,... 
4-2* -4 

[0103] Notice, that d*[k] summarizes the power and constellation (bit) loading information 
that the adaptive modulator passes on to the coder-beamformer. The later relies on cf[k\ and 
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the partial CSI to adapt its design so as to meet constraint CI . To proceed with the adaptive 
beamformer design, we therefore need to analyze the BER performance of the scalar 
equivalent channel per subcarrier, with input Si[k] and output z/[fc], as described by (48). For 
each (deterministic) realization of h eqv [k], the BER when detecting Si[k] in the presence of 
AWGN in (5), can be approximated as 



where the validity of the approximation has also been confirmed. Based on our partial CSI 
model, the transmitter perceives h eqv [k]as a random variable, and evaluates the average BER 
performance on the kth subcarrier as: 



[0104] We will adapt our basis beams u\[k], U2[k] to minimize for a given <?[k], 

based on partial CSI. To this end, we consider the eigen decomposition on the "nominal 
channel" per subcarrier (here the kth) 




(56) 




(57) 



H[k]ff" [k] = U H [fc]A„ [kp" [k\ with 
AM=diag(x i [k\...A Nl l4 



(58) 



where w H [k] is unitary, and A H [k] contains on its diagonal the eigenvalues in a non- 
increasing order: A, [k] > ... > A Nt [k] > 0 . As proved, the optimal u\[k] and U2[&] minimizing 



the BER[k] are: 



«, [k] = u h,\ [k\ u 2 [k] = u 




(59) 
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Notice that the columns of U H [k] are also the eigenvectors of the channel correlation matrix 

E{H[k]H H [k} = J7" [k]+ N r a] [k]l Nt , that is perceived by the transmitter based on 

partial CSI. Hence, the basis beams u\[k] and U2[k] adapt to the two eigenvectors of the 
perceived channel correlation matrix, corresponding to the two largest eigenvalues. 
[0105] Having obtained the optimal basis beams, to complete our beamformer design, we 
have to decide how to split the power P[k] between these two basis beams. 
[0106] With the optimal basis beams, the equivalent scalar channel is: 

*i -*i|#"[*S«[*f +8 2 [kp"[kj tli ,2[kf. (60) 

For i = 1,2, the vector H H [fcf///,, [k]in (17) is Gaussian distributed with 

CN{H H [kJi Hj[k\a*[k][ N } Furthermore, we have that For an 

arbitrary vector a ~ C/v(ju,^ \ the following identity holds true. 

£{exp(-a"4 = exp(- A1 "(/ + X ) _ V)/det(/ + X ) (61) 
Substituting (60) into (57), and applying (61), we obtain: 



BER[k]* 0.2[[ 



,l + *J* 2 [*. 2 W/^oJ 



exp 



^ i+5j*]rf 2 [*K 2 W/^ 0> , 



(62) 



[0107] Eq. (62) shows that the power splitting percentages 5, [k\S 2 [k], depend on 

A, [k\ A 2 [k], and e?[k]. Their optimum values can be found by minimizing (62) to obtain: 

<5, [k] = min(s , [k]l\ 8 2 [k] = max(5 2 [k]o] (63) 
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where, with [k] := A„ [k]l{N,o) [k]) and m„ [k] := (l + K„ [kf /(l + 2K„ [k% p = 1,2, we 
have 



5 fci- m M , ™»M 



^ m,[k] 



,A*=tf. (64) 



[0108] The solution guarantees that 0 < 8 2 [k] < <5, [k] < 1 , and 5, 8 2 [k] = l. Based on the 
partial CSI (h^Jo^ [k§, eqns. (16) and (20) provide the 2D coder-beamformer design with 
the minimum 52s7?[fc], that is adapted to a given cr[k] output of the adaptive modulator. 



Because this minimum depends on d*[k], the natural question at this point is: for 

which values of d*[k], call it d\ [k], will the minimum reach the target BERo[k]? 



[0109] We next establish that BER[k] in (62), with {<5 f {k}} 2 =l specified in (63), is a 
monotonically decreasing function of c?[k\. 



[0110] Lemma: Given partial CSI, the 5£7?[A:] in (62) is a monotonically decreasing 
function of ^[k]. Hence, there exists a threshold d\ [fc]for which 5£7?[A:] < BERo [k] if and 
only if d 2 [k] > d 2 [k] . The threshold d 2 [k] is found by solving (19) with respect to ef[k], 



when BER[k]< BER 0 [k]. 



[0111] Proof. A detailed proof requires the derivative of with respect to d 2 [k], over 

two possible scenarios: 8 2 [k] = 0 , and 5 2 [k] > 0 , as indicated by (63). We have verified 
that this derivative is always less than zero for any given d 2 ^]. However, we will skip the 
lengthy derivation, and provide an intuitive justification instead. Suppose that 5, [k] and 

8 2 [k] are optimized as in (20) for a given Now, let us increase ^[k] by an amount 
A d . Even when <5, [k] and 5 2 [k] are fixed to previously optimized values (i.e, even if the 2D 
coder-beamformer is non-adaptive) the corresponding BER decreases, since signaling with 
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larger minimum distance always leads to better performance. With the minimum 
constellation distance </ 2 [fc]+ A d , optimizing 5 ] [k] and <5 2 [fc]will further decrease the BER. 

Hence, increasing cf[k] decreases ] monotonically. 

[0112] This lemma implies that we can obtain the desirable d*[k]. However, since no closed- 
form solution appears possible, we have to rely on a one-dimensipnal numerical search. 
[0113] To avoid the numerical search, we next propose a simple, albeit approximate, 
solution for d\ [k]. Notice that eq. (62) is nothing but the average BER of an 2AT r -branch 
diversity combining system, with N r branches undergoing Rician fading with Rician factor 
KXk] = X\k]l{N^[k% while the other N r branches are experiencing Rician fading with 
Rician factor K 2 [k] = A 2 [^/(a^ct* [&]). Approximating a Rician distribution by a 
Nakagami-m distribution, we can approximate the BER[k] by: 



3 f=i^ 



(i + ^M*KM)) 



(65) 



where w M is defined after eq. (63). It can be easily verified that BER [k] is also 

monotonically decreasing as cf[k] increases. Setting BER [k] = BERo [k] , we can solve for 
d\ [k] using the following two-step approach: 

Step 1: Suppose that d\ [k] can be found with S 2 [k] > 0 . Substituting (64) into (65), 
we obtain: 



di[k]= 



\k\\5BER Q [k§ 



r i/K[*K) 



~B 0 [k] 



'IkY 



(66) 



where 



(67) 
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To verify the validity of the solution, let us substitute d] [A:]into (21). If 8 2 [k] > 0 is 
satisfied, then (66) yields the desired solution. Otherwise, we go to step 2. 

Step 2: When Step 1 fails to find the desired d\ [k] with 8 2 [k] > 0 , we set 8 2 [k] = 0 
Substituting 8 X [k] - 1 and 8 2 [k] = 0 , we have 



01 J (l + AT,[*])/i»,[i] o'M 



[0114] This approximate solution of d\ [k] avoids numerical search, thus reducing the 
transmitter complexity. 

[0115] We next detail some important special cases. 

[0116] Special Case 1 -MIMO OFDM with one-dimensional (ID) beamforming based on 
partial CSL The ID beamforming is subsumed by the 2D beamforming if one fixes a priori 
the power percentages to 5, [k] = 1 , and 8 2 [k] = 0 . In this case, d\ [k] can be found in closed- 
form. 

[0117] Special Case 2 - SISO-OFDM based on partial CSL The single-antenna OFDM 
based on partial CSI can be obtained by setting N t = N r =l. In this case, A, [k] = |h[& j , 

where U[k] is the "nominal channel" on the kth subcarrier. Hence, this yields d\ [£]in this 

case too, after setting N r =\ 9 and K x := H[fcj la\ [k]. 



[0118] Special Case 3 - MIMO-OFDM based on perfect CSL With a\ [k = 0] the adaptive 

beamformer on each OFDM subcarrier reduces the ID beamformer with 8 2 [k] = 0 . This 
corresponds to the MIMO-OFDM system, when cochannel interference (CCI) is absent. In 
this special case, no Nakagami approximation is need, and the BER performance simplifies 
to 

BER[k] = 0.2 exp(- d 2 [kfo [k]l N 0 ), (69) 
which leads to a simpler calculation of the threshold metrics as 
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d\ [k] = [in(5BERo [kjjjp Q /*,[*]. (70) 

[0119] Special Case 4 - Wireline DMT systems: The conventional wireline channel in DMT 
systems, can be incorporated in our partial CSI model by setting TV, = 1 , N r = 1 , and 

a\ [k] = 0 . In this case, the threshold metric d\ [k] is given by (70) with A, [k] = |#[£] 2 . 
Adaptive Modulation based on Partial CSI 

[0120] With d\ [k] encapsulating the allowable (P[k] 9 b[k]) pairs per subcarrier, we are ready 

to pursue joint power and bit loading across OFDM subcarriers to maximize the data rate. It 
turns out that after suitable interpretations, many existing power and bit loading algorithms 
developed for DMT systems, can be applied to the adaptive MIMO-OFDM system based on 
partial CSI. We first show how the classical Hughes-Hartogs algorithm (HHA) can be 
utilized to obtain the optimal power and bit loadings. 

1) Optimal Power and Bit Loading: As the loaded bits assume finite (non-negative 
integer) values, a globally optimal power and bit allocation exists. Given any allocation of 
bits on all subcarriers, we can construct it in a step by step bit loading manner, with each step 
adding a single bit on a certain subcarrier, and incurring a cost quantified by the additional 
power needed to maintain the target BER performance. This hints towards the idea behind 
the Hughes Hartogs algorithm (HHA): at each step, it tries to find which subcarrier supports 
one additional bit with the least required additional power. Notice that the HHA belongs to 
the class of greedy algorithms that have found many applications such as the minimum 
spanning tree, and Huffman encoding. 

[0121] The minimum required power to maintain i bits in the &th sub carrier with threshold 
metric d\ [k] is d\ [k]/ g(i) . Therefore, the power cost incurred when loading the ith bit to 
the kth subcarrier is 

cM = f#L^L, m , V *. (71) 
V ' g(i) g0-l) 

[0122] For/= 1, wesetg(j-l) = oo, and thus c(k,l)= d%[k]/g(l). In the following 

algorithm, we will use P rem to record the remaining power after each bit loading step, b c [k] to 
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store the number of bits already loaded on the kth subcarrier, and P c [k] to denote the amount 
of power currently loaded on the £th subcarrier. Now we are ready to describe the greedy 
algorithm for joint power and bit loading of the adaptive MIMO-OFDM based on partial 
CSI. 

The Greedy Algorithm: 

1) Initialization: SetP rem =Ptotai. For each subcarrier, set b c [k] = P c [k] = 0 and 
compute dl\k\ 

2) Choose the subcarrier that requires the least power to load one additional bit; i.e., 

select 

*o=*smin<<*.*cM+0 < 72 ) 

* 

3) If the remaining power cannot accommodate it, i.e., if P rem < c(ko,b c [ko]+l), then 
exit with P[k] = P c [k], and b[k] = b c [k]. Otherwise, load one bit to subcarrier ko, and update 
state variables as 



P«=P«-*.*c[*o+l]). (73) 

Pc[*o] = P«[*o] + c(*b.*c[*o]+l). (74) 

*efc] = U*o]+l- (75) 

4) Loop back to step 2. 
[0123] The greedy algorithm yields a "1-bit optimal" solution, since it offers the optimal 
strategy at each step when only a single bit is considered. In general, the 1-bit optimal 
solution obtained by a greedy algorithm may not be overall optimal. However, for our 
problem at hand, we establish in Appendix I the following: 

[0124] Proposition 1: The power and bit loading solution {P[fcJ&[fc]}£|J that the greed 
algorithm converges to, in a finite number of steps, is overall optimal. 
[0125] Notice that the optimal bit loading solution may not be unique. This happens when 
two or more subcarriers have identical d\ [k] under their respective (and possibly different) 

performance requirements. However, a unique solution can be always obtained, after 
establishing simple rules to break possible ties that may arise. 
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[0126] Allowing for both rectangular and square QAM constellations, the greedy algorithm 
loads one bit at a time. However, only square QAMs are used in may adaptive systems. If 
only square QAMs are selected during the adaptive modulation stage, we can then load two 
bits in each step of the greedy algorithm, and thereby halve the total number of iterations. It 
is natural to wonder whether restricting the class to square QAMs has a major impact on 
performance. Fortunately, as the following proposition establishes, limiting ourselves to 
square QAMs only incurs marginal loss: 

[0127] Proposition 2: Relative to allowing for both rectangular and square QAMs incurs up 
to one bit loss (on the average) per transmitted space-time coded block, that contains two 
OFDM symbols. 

[0128] Compared to the total number of bits conveyed by two OFDM symbols, the one bit 
loss is negligible when using only square QAM constellations. However, reducing the 
number of possible constellations by 50% simplifies the practical adaptive transmitter design. 
These considerations advocate only square QAM constellations for adaptive MIMO-OFDM 
modulation (this excludes also the popular BPSK choice). 

[0129] The reason behind Proposition 2 is that square QAMs are more power efficient than 
rectangular QAMs. With K subcarriers at our disposal, it is always possible to avoid usage of 
less efficient rectangular QAMs, and save the remaining power for other subcarriers to use 
power-efficient square QAMs. Interestingly, this is different from the adaptive modulation 
over flat fading channels, where the transmit power is constant and considerable loss (on bit 
every two symbols on average) is involved, if only square QAM constellations are adopted. 

2) Practical Considerations: The complexity of the optimal greedy algorithm is on 
the order of 0(N bits K) , where Afc j£s is the total number of bits loaded, and K is the number of 

subcarriers. And it is considerable when AT Wtt and K are large. Alternative low-complexity 
power and bit loading algorithms have been developed for DMT application. Notice that [4] 
and [19] study a dual problem: optimal allocation of power and bits to minimize the total 
transmission power with a target number of bits. Interestingly, the truncated water- filling 
solution can be modified and used in our transmitter design, while the fast algorithm can not, 
since it requires knowledge of the total number of bits to start with. In spite of low- 
complexity, the algorithm is suboptimal, and may result in a considerable rate loss due to the 
truncation operation. 
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[0130] The overall adaptation procedure for the adaptive MIMO-OFDM design based on 
partial CSI can be summarized as follows: 

1) Basis beams per subcarrier {u x [k\ u 2 [k]j*~Q are adapted first using (59), to obtain 
an adaptive 2D coder beamformer for each subcarrier. 

2) Power and bit loading {&[H^]}*=o is then jointly performed across all 
subcarriers, using the algorithm in [15] that offers optimality at complexity lower 
than the greedy algorithm. 

3) Finally, power splitting between the two basis beams on each subcarrier 

[k] S 2 [kY k=l is decided using (63). 



Examples 

[0131] We set K = 64, L = 5, and assume that the channel taps are i.i.d. with covariance 
matrix ^ pv = — - — I L=1 We allow for both rectangular and square QAM constellations in 

the adaptive modulations stage. Let the average transmit-SNR (signal to noise ration) across 
subcarriers is defined as: SNR = P t otaiTJ(KNo). The transmission rate (the loaded number of 
bits) is counted every two OFDM symbols as: ^ £jj 2b\k\. 
Comparison between exact and approximate solution 

[0132] Typical MIMO multipath channels were simulated with N t =4 9 N r = 2, and = 1. 
For a certain channel realization, assuming 2D beamforming on each subcarrier, FIG. 13 
plots the thresholds d\ [k] obtained via numerical search, and from the closed-form solution 

based on eq. (65), with p = 0.5, 0.8, 0.9 and a target BER = 10" 3 . FIG. 14 is the counterpart 
of FIG. 13, but with target BER = 10" 4 . The non-negative eigenvalues X x [k] and A 2 [k]of the 
nominal channels are also plotted in dash-dotted lines for illustration purpose. Observe that 
the solutions of d\ [k] obtained via these two different approaches are generally very close to 
each other. And the discrepancy decreases as the feedback quality p increases, or, as the 

target BERo increases. Notice that the suboptimal closed-form solution in practice, some 
SNR margins may be needed to ensure the target BER performance. Nevertheless, the 
suboptimal closed-form solution for d\ [k] will be used in the ensuing numerical results. 
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[0133] FIGS. 13 and 14 also reveal that on subchannels with large eigenvalues (indicating 
"good quality"), the resulting d\ [k] is small; hence, large size constellations can be afforded 
on those subchannels. 

Power and bit loading with the Greedy Algorithm 

[0134] We set N t = 4,N r = 2, p = 0.5, SNR = 9 dB, and ~BERo = 1 0" 4 For a certain channel 
realization, we plot the power and bit loading solutions obtained via the greedy algorithm in 
FIGS. 15 and 16, respectively. For illustration purpose, we also plot the threshold metrics 
dl\k\. We observe that whenever there is a change in the bit loading solution in FIG. 1 6 

from one subcarrier to the next, there will be an abrupt change in the corresponding power 
loading in FIG. 15. Furthermore, for those subcarriers with the same number of bits, the 
power loaded by the greedy algorithm is proportional to the threshold metric. Also, from the 
bit loading of the greedy algorithm in FIG. 16, we see that all subcarriers are loaded with an 
even number of bits (with the exception of one subcarrier at most), which is consistent with 
Proposition 2. 

[0135] Test case 3 - Adaptive MIMO OFDM based on partial CSL In addition to the 
adaptive MIMO-OFDM based on ID and 2D coder-beamformers, we derive an adaptive 
transmitter that relies on higher-dimensional beamformers on each OFDM subcarrier; we 

term it any-D beamformer here. With BERo = 10^ , we compare non-adaptive transmission 
schemes (that use fixed constellations per OFDM subcarrier) and adaptive MIMO-OFDM 
schemes based on any-D, 2D, and ID beamforming in FIG. 16 with N t = 2,N r = 2, in FIG. 18 
with N t = 4, N r = 2, and in Fig. 8 with N t = 4,N r = 4. The Alamouti codes are used when N t = 
2, and the rate % STBC code is used when N t = 4. The transmission rates for adaptive 
MIMO-OFDM are averaged over 200 feedback realizations. 

[0136] With N t = 2 in FIG. 17, the any-D beamformer reduces to the 2D coder-beamformer, 
since there are at most two basis beams. With N t = 4 in FIGS. 18 and 19, 23 observe that the 
adaptive transmitter based on 2D coder-beamformer achieves almost the same data rate as 
that based on any-D beamformer, for variable quality of the partial CSI (as p varies), and 
various size MIMO channels (as N r varies). Thanks to its reduced complexity, 2D 
beamforming is thus preferred over any-D beamforming. On the other hand, the ID 
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beamforming is considerably inferior to 2D beamforming when low quality CSI is present at 
the transmitter. But as CSI quality increases (e.g., p > 0.9), the transmitter based on ID 
beamforming approaches the performance of that based on 2D beamforming. 
[0137] With N t = 2 9 N r =2 in FIG. 17, the adaptive MIMO-OFDM based on the 2D coder- 
beamformer always outperforms non-adaptive alternatives. With N t = 4, N r = 2 in FIG. 18, 
the non-adaptive transmitter at the low SNR range, with extremely low feedback quality ( p 

= 0). However, as the SNR increases, or, the feedback quality improves, the adaptive 2D 
transmitter outperforms the non-adaptive transmitter considerably. As the number of receive 
antennas increase to N r = 4 in FIG. 19, the adaptive 2D beamforming transmitter is uniformly 
better than the non-adaptive transmitter, regardless of the feedback quality. 
Proofs 

Based on (28) and (12) we have 

c{k,i)=2 2(hl) dZ[k]joT / = 2/-l,2/,and / = 1,2,.... (76) 
Table I lists the required power to load the ith bit on the Ath subcarrier. 



i 


1 


2 


3 


4 


5 




dl[k]lg(i) 




2dl[k] 


6d 2 0 [k] 


\0d 2 0 [k] 


26d 2 [k] 




c(k,i) 


dl[k\ 


d 2 0 [k] 


Adl[k] 


4d 2 0 [k] 


I6d 2 [k] 





TABLE 1 



From Table I and eq. (33), we infer that 

c{k,i = l)>c(k,i\ Vi,k. (77) 
[0138] Although the greedy algorithm chooses always the 1-bit optimum, eq. (77) reveals 
that all future additional bits will cost no less power. This is the key to establishing the 
overall optimality because no matter what the optimal final solution is, the bits on each 
subcarrier can be constructed in a bit-by-bit fashion, with every increment being most power- 
efficient, as in the greedy algorithm. Hence, the greedy algorithm is overall optimal for our 
problem at hand. Lacking an inequality like (77), the optimality has been formally 
established. 

[0139] An important observation from (76) is that c(k, 2j-l) = c(k, 2j) holds true for any k 
and j. Suppose at some intermediate step of the greedy algorithm, the (2/ - l)st bit on the Mi m 
subcarrier is the chosen bit to be loaded, which means that the associated cost c(k, 2j-l) is 
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the minimum out of all possible choices. Notice that c(k, 2j) = c(k, 2j - 1) has exactly the 
same cost, and therefore, after loading the {2j - l)st bit on the Ath subcarrier, the next bit 
chosen by the optimal greedy algorithm must be the (2/)th bit on the same subcarrier, unless 
power insufficiency is declared. So, the overall procedure effectively loads two bits at a 
time: as long as the power is adequate, the greedy algorithm will always load two bits in a 
row to each subcarrier. Let us denote the total number of bits as R square = 2^ £J b x [n;k], 

when using only square QAMs, and R rect = 2]T £jj b 2 [n;k] when allowing also for 
rectangular QAMs. AT most on one subcarrier k\ it holds that b 2 [n\ k' ]= b x [n; It ]+ 1 , which 
has probability 54; while for all other subcarriers, b 2 [n; k] = b x [n; k] + 1 Hence, R sqU are is less 
than R rect by most one bit per space time coded OFDM block. 

Higher Than Two-D Beamforming 

[0140] For practical deployment of the adaptive transmitter, we have advocated the 2D 
coder-beamformer on each OFDM subcarrier. With N t >2 however, higher than 2D coder 
beamformers have been developed. They are formed by concatenating higher dimensional 
orthogonal space-time block coding designs, with properly loaded space time multiplexers. 
Collecting more diversity through multiple basis beams, the optimal A^-dimensional 
beamformer outperforms the 2D coder-beamformer, from the minimum achievable her 
point of view. Hence, with more than two basis beams, the threshold metric per subcarrier 
may improve, and the constellation size on each subcarrier may increase under the same 
performance constraint. However, the main disadvantage of A^-dimensional beamforming is 
that the orthogonal STBC design loses rate when N t > 2. The important issue in this context 
is how much one could lose in adaptive transmission rate by focusing only on the 2D coder- 
beamformer, instead of allowing all possible choices of beamforming that can use up to N t 
basis beams. 

[0141] In the following, we use the notation n t D to denote beamforming with n t "strongest" 
basis beams. With n t < 2, two symbols are transmitted over two time slots as in (2). When n t 
= 3,4, the beamformer can be constructed based on the rate % orthogonal SBC, with three 
symbols transmitted over four time slots. When 5 < n t < 8, the beamformer can be 
constructed based on the rate 14 orthogonal STBC, with four symbols transmitted over eight 



40 



Docket No.: 1008-022US01 / Z04015 



time slots. Let us consider, for simplicity, a maximum of eight directions even when N t > 8, 
i.e., n lmax = min (N h 8). If we take a super block with eight OFDM symbols as the adaptive 
modulation unit, then each super block allows for different n t D beamformers on different 
subcarriers at each modulation adaptation step. Specifically, in one super block, one 
subcarrier could place four 2D coder-beamformers, or, two 4D beamformers, or one 8D 
beamformer, depending on partial CSI. With constellation size M[k], the corresponding 
transmission rate for the n t D beamformer is 8 f log 2 (M [k]j per subcarrier per super block, 

where =1 iorn t = 1,2, =3/4for n t =3,4, and =1/2 for n t =5,6,7,8. 
Furthermore, with power P[k] on each subcarrier, the energy per information symbol is 
d 2 [k] = (l / f )g{b\k\j?\k\ This includes (1 1) as a special case with f\ = $i = 1 . 

[0142] As with 2D beamforming, we wish to maximize the transmission rate of the MIMO- 
OFDM subject to the performance constraint on each subcarrier. We first determine the 
distance threshold d\ 5/J [k] on each subcarrier for the D beamformer, where 1 < n t < n t>max . 

With the average BER expression for the n t D beamformer, we find d\ , [k] through one 

dimensional numerical search. Hence, if the assigned constellation has d 2 [k] > d\ , [k], 

adopting the n t D beamformer will lead to the guaranteed BER performance, thanks to the 
monotonicity we established in our Lemma. 

[0143] Having specified {rf 0 2 , fl( fc]}^ f° r eac h n t e [l>2 v ., n< ^ J, we can also modify our 

greedy algorithm, to obtain the optimal power and bit loading across subcarriers. First we 
define the effective number of bits b e := bf when 2^-QAM is used together with n t D 

beamforming. Second, we constrain the effective number of bits b e to be integers, in order to 
facilitate the problem solving procedure. To achieve this, non-integer QAMs are assumed 
temporarily available for an n t (we will later on quantize them to the closet square or 
rectangular QAMs). This entails a certain approximation error, but our objective here is to 
quantify the difference between 2D beamforming and any n t D beamforming. The greedy 
algorithm can be applied as described, but with each step loading effectively one bit on 
certain subcarrier. Specifically, we need to replace c(k,b e +1) in the original greedy 
algorithm with c(k,b e +1 ), where 



41 



Docket No.: 1008-022US01 / Z04015 



c{k,b e +l) = min 



-min 









(78) 



is the minimal power required to load one additional bit on top of b e effective bits on the Ath 
subcarrier, given that all possible n t D beamformers can be arbitrarily chosen. Notice that the 
optimal beamforming, based on as many as n Umax basis beams, includes 2D beamforming as a 
special case with n Umax = 2. Numerical results demonstrate that the 2D transmitter performs 
close to any higher dimensional one in most practical cases. However, the 2D transmitter 
reduces the complexity considerably, which is the reason why we favor the 2D coder- 
beamformerin practice. 
Conclusion 

[0144] The described MIMO-OFDM transmissions are capable of adapting to partial 
(statistical) channel state information (CSI). Adaptation takes place in three (out of four) 
levels at the transmitter: The power and (QAM) constellation size of the information 
symbols; the power splitting among space-time coded information symbol substreams; and 
the basis-beams of two- (or generally multi-) dimensional beamformers that are used (per 
time slot) to steer the transmission over the flat MIMO subchannels corresponding to each 
subcarrier. 

[0145] For a fixed transmit-power, and a prescribed bit error rate performance per subcarrier, 
we maximize the transmission rate for the proposed transmitter structure over frequency- 
selective MIMO fading channels. The power and bits are judiciously allocated across space 
and subcarriers (frequency), based on partial CSI. Analogous to perfect-CSI-based DMT 
schemes, we established that loading in our partial-CSI-based MIMO OFDM design is 
controlled by a minimum distance parameter (which is analogous to the SNR-threshold used 
in DMT systems) that depends on the prescribed performance, the channel information, and 
its reliability, as those partially (statistically) perceived by the transmitter. This analogy we 
established offers two important implications: i) it unifies existing DMT metrics under the 
umbrella of partial CSI; and ii) it allows application of existing DMT loading algorithms 
from the wireline (perfect CSI) setup to the pragmatic wireless regime, where CSI is most 
often known only partially. 
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[0146] Regardless of the number of transmit antennas, the adaptive two-dimensional coder- 
beamformer should be preferred in practice, over higher-dimensional alternatives, since it 
enables desirable performance-rate-complexity tradeoffs. 

[0147] Various embodiments of the invention have been described. The described 
techniques can be embodied in a variety of transmitters including base stations, cell phones, 
laptop computers, handheld computing devices, personal digital assistants (PDA's), and the 
like. The devices may include a digital signal processor (DSP), field programmable gate 
array (FPGA), application specific integrated circuit (ASIC) or similar hardware, firmware 
and/or software for implementing the techniques. In other words, constellation selectors and 
Eigen-beam- formers, as described herein, may be implemented in such hardware, software, 
firmware, or the like. 

[0148] If implemented in software, a computer readable medium may store computer 
readable instructions, i.e., program code, that can be executed by a processor or DSP to carry 
out one of more of the techniques described above. For example, the computer readable 
medium may comprise random access memory (RAM), read-only memory (ROM), non- 
volatile random access memory (NVRAM), electrically erasable programmable read-only 
memory (EEPROM), flash memory, or the like. The computer readable medium may 
comprise computer readable instructions that when executed in a wireless communication 
device, cause the wireless communication device to carry out one or more of the techniques 
described herein. These and other embodiments are within the scope of the following claims. 
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